Designing Training Sample Size for Support Vector Machines Based on Kinematic Gait Data
نویسندگان
چکیده
INTRODUCTION Traditionally, biomechanical gait data are analyzed using discrete variables and a univariate statistical approach. In contrast, the multivariate and complex pathoaetiology of a running-related musculoskeletal injury often demands a more robust analysis [1]. Support Vector Machines (SVMs) have recently been applied to analyze biomechanical data due to its ability to identify complex associations amongst many discrete gait variables [2]. However, the number of observations required to train the SVM classifier to solve biomechanics classification problems is still unknown. For this reason, generic rule-of-thumb approaches have been used suggesting that a very large sample size is required to provide appropriate training for optimal classification. Unfortunately in clinical studies, large samples are usually not readily available. In addition, large sample sizes may be wasteful when fewer samples may result in optimal classification. Furthermore, a large sample size does not guarantee that one will obtain optimal classification since the groups may considerably overlap in their distribution, even in high-dimensional space. Therefore, it is paramount to a priori approximate the sample size in biomechanical studies. Hence, the aim of this study is to provide guidance to plan the appropriate sample size for SVM training classification based on discrete kinematic gait variables.
منابع مشابه
Separating Well Log Data to Train Support Vector Machines for Lithology Prediction in a Heterogeneous Carbonate Reservoir
The prediction of lithology is necessary in all areas of petroleum engineering. This means that to design a project in any branch of petroleum engineering, the lithology must be well known. Support vector machines (SVM’s) use an analytical approach to classification based on statistical learning theory, the principles of structural risk minimization, and empirical risk minimization. In this res...
متن کاملApplication of Artificial Neural Networks and Support Vector Machines for carbonate pores size estimation from 3D seismic data
This paper proposes a method for the prediction of pore size values in hydrocarbon reservoirs using 3D seismic data. To this end, an actual carbonate oil field in the south-western part ofIranwas selected. Taking real geological conditions into account, different models of reservoir were constructed for a range of viable pore size values. Seismic surveying was performed next on these models. F...
متن کاملFace Recognition using Eigenfaces , PCA and Supprot Vector Machines
This paper is based on a combination of the principal component analysis (PCA), eigenface and support vector machines. Using N-fold method and with respect to the value of N, any person’s face images are divided into two sections. As a result, vectors of training features and test features are obtain ed. Classification precision and accuracy was examined with three different types of kernel and...
متن کاملUsing Support Vector Machines to Evaluate Financial Fate of Dotcoms
The success of dotcoms has been short lived. Within a short span of a few years the dotcoms experienced a meteoric rise and suffered a dramatic fall. This research is a support vector machine based investigation of dotcoms from a financial perspective. Data from the financial statements of survived and failed dotcoms are collected and 24 financial ratios are computed. The financial ratios are a...
متن کاملAutomatic Interpretation of UltraCam Imagery by Combination of Support Vector Machine and Knowledge-based Systems
With the development of digital sensors, an increasing number of high-resolution images are available. Interpretation of these images is not possible manually, which necessitates seeking for practical, fast and automatic solutions to solve the environmental and location-based management problems. The land cover classification using high-resolution imagery is a difficult process because of the c...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012